The Parti - Game Algorithm
نویسندگان
چکیده
Parti-game is a new algorithm for learning feasible trajectories to goal regions in high dimensional continuous state-spaces. In high dimensions it is essential that learning does not plan uniformly over a state-space. Parti-game maintains a decision-tree partitioning of state-space and applies techniquesfrom game-theoryand computational geometryto eeciently and adaptively concentrate high resolution only on critical areas. The current version of the algorithm is designed to nd feasible paths or trajectories to goal regions in high dimensional spaces. Future versions will be designed to nd a solution that optimizes a real-valuedcriterion. Many simulated problems have been tested, ranging from two-dimensional to nine-dimensionalstate-spaces, including mazes, path planning, non-linear dynamics, and planar snake robots in restricted spaces. In all cases, a good solution is found in less than ten trials and a few minutes. is a promising method for robots to program and improve themselves. This paper addresses one of reinforcement learning's biggest stumbling blocks: the curse of dimensionality 5], in which costs increase exponentially with the number of state variables. These costs include both the computational eeort required for planning and the physical amount of data that the control system must gather. The curse of dimensionality is also a problem in other areas of artiicial intelligence, such as planning and supervised learning. Much work has been performed with discrete state-spaces: in particular a class of Markov decision tasks known as grid worlds 29], 28]. Most potentially useful applications of reinforcement learning, however, take place in multidimensionalcon-tinuous state-spaces. The obvious way to transform such state-spaces into discrete problems involves quantizing them: partitioning the state-space into a multidimen-sional grid, and treating each box within the grid as an atomic object. Although this can be eeective (see, for instance, the pole balancing experiments of 19], 4]), the naive grid approach has a number of dangers which will be detailed in this paper.
منابع مشابه
Robust, Efficient, Globally-Optimized Reinforcement Learning with the Parti-Game Algorithm
Parti-game (Moore 1994a; Moore 1994b; Moore and Atkeson 1995) is a reinforcement learning (RL) algorithm that has a lot of promise in overcoming the curse of dimensionality that can plague RL algorithms when applied to high-dimensional problems. In this paper we introduce modifications to the algorithm that further improve its performance and robustness. In addition, while parti-game solutions ...
متن کاملModifying the Parti-game Algorithm for Increased Robustness, Higher Eeciency and Better Policies
Parti-game (Moore 1994a; Moore 1994b; Moore and Atkeson 1995) is a reinforcement learning (RL) algorithm that has a lot of promise in overcoming the curse of dimensionality (Bellman 1957) that can plague RL algorithms when applied to high-dimensional problems. In this paper we introduce modiications to the algorithm that further improve its performance and robustness. In addition, while parti-g...
متن کاملSpeeding up the Parti-Game Algorithm
In this paper, we introduce an efficient replanning algorithm for nondeterministic domains, namely what we believe to be the first incremental heuristic minimax search algorithm. We apply it to the dynamic discretization of continuous domains, resulting in an efficient implementation of the parti-game reinforcement-learning algorithm for control in high-dimensional domains.
متن کاملEfficient Reinforcement Learning in Continuous Environments
Reinforcement Learning (RL) is a machine learning paradigm with which autonomous agents can improve their behavior in unknown environments based on their own experience without an explicit teacher signal. RL algorithms are based on estimating a value function over the state space, and scaling them to large state spaces remains a challenge. One approach, known as variable resolution, is to focus...
متن کاملNavigation of a Kinematically Restricted Wheelchair by the Parti{game Algorithm
This paper describes a technique of adaptive state{space partitioning for mobile robot navigation. Following this approach, the controller of a wheelchair robot adaptively discretises its state{space and learns which partitions are connected and which are not. Due to the wheelchair's kinematic restrictions, the occuring problems diier from those of holo-nomic research robots. Several successful...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1995